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Abstract 

In just three years, the green fluorescent protein (GFP) fromthejellyfishA^Korea 
victoria has vaulted from obscurity to become one of the most widely studied 
and exploited proteins in biochemistry and cell biology. Its amazing ability to 
generate a highly visible, efficiently emitting internal fiuorophorc is both intrin- 
sically fascinating and tremendously valuable. High-resolution crystal structures 
of GFP offer unprecedented opportunities to understand and manipulate the rela- 
tion between protein structure and spectroscopic function. GFP has become well 
established as a marker of gene expression and protein targeting in intact cells 
and organisms. Mutagenesis and engineering of GFP into chimeric proteins are 
opening new vistas in physiological indicators, biosensors, and photochemical 
memories. 
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NATURAL AND SCIENTIFIC HISTORY OF GFP 

Discovery and Major Milestones 

Green Fluorescent Protein was discovered by Shimomura et al (1) as a compan- 
ion protein to aequorin, the famous chemiluminescent protein from Aequorea 
jellyfish. In a footnote to their account of aequorin purification, they noted 
that "a protein giving solutions lhat look slightly greenish in sunlight through 
only yellowish under tungsten lights, and exhibiting a very bright, greenish 
fluorescence in the ultraviolet of a Mineralite, has also been isolated from 
squeezates." This description of the appearance of GFP solutions is still ac- 
curate. The same group (2) soon published the emission spectrum of GFP, 
which peaked at 508 nm. They noted that the green bioluminescence of living 
Aequorea tissue also peaked near this wavelength, whereas the chemilumines- 
cence of pure aequorin was blue and peaked near 470 nm, which was close 
to one of the excitation peaks of GFP. Therefore the GFP converted the blue 
emission of aequorin to the green glow of the intact cells and animals. Morin 
& Hastings (3) found the same color shift in the related coelenterates Obelia 
(a hydroid) and Renilla (a sea pansy) and were the first to suggest radiation- 
less energy transfer as the mechanism for exciting coelenterate GFPs in vivo. 
Morise et al (4) purified and crystallized GFP, measured its absorbance spectrum 
and fluorescence quantum yield, and showed that aequorin could efficiently 
transfer its luminescence energy to GFP when the two were coadsorbed onto a 
cationic support. Prendergast & Mann (5) obtained the first clear estimate for the 
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GREEN FLUORESCENT PROTEIN 511 

monomer molecular weight. Shimomura (6) proteolyzed denatured GFP, ana- 
lyzed the peptide diat retained visible absorbance, and correctly proposed that 
the chromophore is a 4-(p-hydroxybenzylidene)imidazolidin-5-one attached to 
the peptide backbone through the 1- and 2-positions of the ring. 

Aequorea and Renilla GFPs were later shown to have the same chromophore 
(7); and the pH sensitivity, aggregation tendency (8), and renaturation (9) of 
Aequorea GFP were characterized. But the crucial breakthroughs came with 
the cloning of the gene by Prasher et al (10) and the demonstrations by Chalfie 
et al (1 1) and Inouye & Tsuji (12) that expression of the gene in other organisms 
creates fluorescence. Therefore the gene contains all the information necessary 
for the posttranslational synthesis of the chromophore, and no jellyfish-specific 
enzymes are needed. 

Occurrence, Relation to Bioluminescence, 
and Comparison with Other Fluorescent Proteins 
Green fluorescent proteins exist in a variety of coelenterates, both hydrozoa such 
as Aequorea, Obelia, and Phialidium, and anthozoa such as Renilla (3, 13). In 
this review, GFP refers to the Aequorea protein except where another genus 
name is specifically indicated. These GFPs seem to be partners with chemi- 
luminescent proteins and to control the color of the emission in vivo. Despite 
interesting speculations, it remains unclear why these coelenterates glow, why 
green emission should be ecologically so superior to the blue of the primary 
emitters, and why the animals synthesize a separate GFP rather than mutate the 
chemiluminescent protein to shift its wavelengths. Other than Aequorea GFP, 
only Renilla GFP has been biochemically well characterized (14). Despite the 
apparent identity of the core chromophore in Renilla and Aequorea GFP, Re- 
nilla GFP has a much higher extinction coefficient, resistance to pH-induced 
conformational changes and denaturation, and tendency to dimerize (7). 

Unfortunately, Aequorea GFP genes are the only GFP genes that have been 
cloned. Several other bioluminescent species also have emission-shifting ac- 
cessory proteins, but so far the chromophores all seem to be external cofac- 
tors such as lumazines (15) or flavins (16), which diminish their attractive- 
ness as biotechnological tags and probes. Likewise phycobiliproteins (17) and 
peridinin-chlorophyH-a protein (18), which are highly fluorescent and attrac- 
tively long-wavelength accessory pigments in photosynthesis, use tetrapyrrole 
cofactors as their pigments. Correct insertion of the cofactors into the apopro- 
teins has not been demonstrated in foreign organisms, so these proteins are 
not ready to compete with Aequorea GFP. A variety of marine organisms fluo- 
resce, but the biochemistry of the fiuorophores is almost completely unknown. 
Painstaking research like that undertaken by the pioneers of Aequorea and 
Renilla GFP would be needed before cloning efforts could begin. It is unclear 
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whether any investigators or granting agencies are still patient enough to un- 
dertake and fund such long-term groundwork. 

PRIMARY, SECONDARY, TERTIARY, 
AND QUATERNARY STRUCTURE 

Primary Sequence from Cloning 

The sequence of wild-type Aequorea GFP (10) is given in Figure 1. Sequences 
of at least four other isoforms are known (19), though none of the mutations seem 
to be in positions known to influence protein behavior. Most cDNA constructs 
derived from the original sequence contain the innocuous mutation Q80R, prob- 
ably resulting from a PCR error (11). Also, the gene has been resynthesized 
with altered codons and improved translational initiation sequences (see section 
on "Promoters, Codon Usage, and Splicing"). 

The chromophore is ap-hydroxybenzylideneimidazolinone (10, 20) formed 
from residues 65-67, which are Ser-Tyr-Gly in the native protein. Figure 2 
shows the currently accepted mechanism (21-23) for chromophore forma- 
tion. First, GFP folds into a nearly native conformation, then the imidazoli- 
none is formed by nucleophilic attack of the amide of Gly67 on the carbonyl 
of residue 65, followed by dehydration. Finally, molecular oxygen dehydra- 
genates the a-p bond of residue 66 to put its aromatic group into conjugation 
with the imidazolinone. Only at this stage does the chromophore acquire vis- 
ible absorbance and fluorescence. This mechanism is based on the following 
arguments: (a) Atmospheric oxygen is required for fluorescence to develop 
(21, 24). (b) Fluorescence of anaerobically preformed GFP develops with a 
simple exponential time course after air is readmitted (21, 25), which is essen- 
tially unaffected by the concentration of the GFP itself or of cellular cofactors. 
(c) Analogous imidazolinones autoxidize spontaneously (26). (d ) The proposed 



Figure J GFP sequences. (Line 1) The wild-type (WT) gfplO gene as originally cloned and 
sequenced by Prasher et al (10). (Line 2) A popular humanized version (EGFP, Clontech Labora- 
tories. Palo Alto, CA) (64) incorporating (a) an optimal sequence for translational initiation (66) 
including insertion of a new codon GTG; (6) mu tation of Phe64 to Leu to improve folding at 37° C; 
(c) mutation of Ser65 to Thr to promote chromophore ionization; and (d) mutation of His231 to 
Leu. which was probably inadvertent and neutral. (Line 3) WT amino acid sequence. (Lines 4 
and 5) Numbering of amino acids and differences between EGFP and WT. The inserted Vat is 
numbered la to maintain correspondence with the WT numbering. Note that constructs derived 
from the natural gfplO cDNA contain an apparently neutral mutation Gln80 -> Arg (Q80R) caused 
by a PCR error that changed the CAG codon loCGG(ll). In some genes artifici ally resynthesized 
with different codons, this error was corrected (e.g. 63 and Clontech's EGFP) but was left as an 
arginine codon in other instances (40,62). 
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ATG AGT AAA GGA GAA OAA CTT TIC ACT GGA QTT GTC CCA ATT CTT GTT GAA TTA GAT GOT 60 

ATG GTG AGO AAG GGC GAG GAG CTG TTC ACC GGG GIG GTG CCC ATC CTG GTC GAG CTG GAC GGC 
Mat Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu Asp Gly 
1 Val S 10 IS 20 

(la) 

GAT GTT RAT GGG CAC AAA TTT TCT GTC AGT GGA GAG GGT GAA GGT GAT GCA ACA TAC GGA 1ZO 
GAC GTA AAC GGC CAC AAG TTC ACC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC 
Asp Val Asn Gly His Lys Phe ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 
25 30 35 40 

AAA CTT ACC CTT AAA TTT ATT TGC ACT ACT GGA AAA CTA CCT GTT CCA TGG CCA ACA CTT 160 
AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC 
LyB Leu Thr Leu Lya Phe lie Cye Thr Thr Gly Lya Leu Pro Val Pro Trp Pro Thr Leu 
« 50 55 60 

GTC ACT ACT TTC TCT TAT GGT GTT CAA TGC TTT TCA AGA TAC CCA GAT CAT ATC AAA CAG 240 

GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGO CGC TAC CCC GAC CAC ATG AAG CAG 

Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Hot Lya Gin 
Leu Thr 70 75 80 
65 

CAT GAC TTT TTC AAG AGT CCC ATG CCC GAA GGT TAT GTA CAG GAA AGA ACT ATA TTT TTC 300 

CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC 

His Aap Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe 
SS 90 95 100 

AAA GAT GAC GGG AAC TAC AAG ACA CGT GCT GAA GTC AAG TTT GAA GGT GAT ACC CTT GTT 360 
AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG CTC AAC TTC GAG GGC GAC ACC CTG GTG 

Lya Asp Aap Gly Aan Tyr Lya Thr Arg Ala olu Val Lya Phe Glu Gly Aap Thr Leu Val 
103 110 lis 120 

AAT AGA ATC CAC TTA AAA GGT ATT CAT TTT AAA GAA GAT GCA AAC ATT CTT GGA CAC AAA 420 
AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC RAG CAG GAC GGC AAC ATC CTG GGG CAC AAG 
Asn Arg lie Glu Leu Lys Gly lie Aap Phe Lys Glu Asp Gly Asn He Leu Gly His Lya 
125 130 i3 5 140 

TTG GAA TAC AAC TAT AAC TCA CAC AAT GTA TAC ATC ATG GCA GAC AAA CAA AAG AAT GCA 480 
CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC 
Leu Glu Tyr Asn Tyr Aan Ser His Asn Val Tyr He Met Ala Asp Lya Gin Lys Asn Gly 
145 150 155 160 

ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT GAA GAT GGA AGC GTT CAA CTA GCA GAC 540 

ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC 

He Lys Val Aan Phe Lys He Arg His Asn lie Glu Asp Gly ser Val Gin Leu Ala Aap 
163 170 175 180 

CAT TAT CAA CAA AAT ACT CCA ATT GGC GAT GGC CCT GTC CTT TTA CCA GAC AAC CAT TAC 600 
CAC TAC CAG CAG AAC ACC CCC ATC CGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC 
His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tvr 

185 190 195 200 

CTG TCC ACA CAA TCT GCC CTT TCG AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTT 660 
CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG 
Leu Ser Thr Gin Ser Ala Leu Ser Lya Aap Pro Asn Glu Lys Arg Asp His Met Val Leu 

205 210 215 220 

CM GAG r£ IS ~ !™ "* ftCA CAT ° GC AT ° GM GM C ™ TAC AAA TAA TAA 720 

CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 
Leu Glu Phe Val Thr Ala Ala Gly He Thr Bis Gly Met Asp Glu Leu Tyr Lys stop 
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figure 2 Mechanism proposed by Cubitt et a] (22) for the intramolecular biosynthesis of the GFP 
chromophore, with rate constants estimated for the Ser65 -> Thr mutant by Reid & Flynn (23) and 
Heiinetal(25). 



cyclization is isosteric with the known tendency for Asn-Gly sequences to cy- 
clize to imides (27). Glycine is by far the best nucleophile in such cyclizations 
because of its minimal steric hindrance, and GIy67 is conserved in all known 
mutants of GFP that retain fluorescence. («?) Blectrospray mass spectra indi- 
cate that anaerobically preformed GFP loses only 1 ± 4 Da upon exposure 
to air, consistent with the predicted loss of two hydrogens (22). This implies 
that the dehydration (-18 Da) must already have occurred anaerobically and 
must precede oxidation. (/) Reid & Flynn (23) have extensively characterized 
the kinetics of in vitro refolding of GFP from bacterial inclusion bodies with 
no chromophore, urea-denatured protein with a mature chromophore, and de- 
natured protein with a chromophore reduced by dithionite. Renaturation was 
measured by development of fluorescence and resistance to trypsin attack. Their 
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GREEN FLUORESCENT PROTEIN 515 

results support the sequential mechanism and provide the rate constants shown 
in Figure 2. However, many other aspects of the maturation mechanism re- 
main obscure, such as the steric and catalytic roles of neighboring residues, the 
means by which mutations can improve folding efficiency, and the dependence 
of oxidation rate on the oxygen concentration and the protein sequence. 

One predicted consequence of oxidation by 0 2 is that hydrogen peroxide, 
H 2 0 2 , is presumably released in 1:1 stoichiometry with mature GFP. This 
byproduct might explain occasions when high-level expression of GFP can 
be deleterious. Perhaps catalase could be useful in such cases. Some difficult 
GFPs seem to express most readily when targeted to peroxisomes and mitochon- 
dria (28; R Rizzuto, T Pozzan, personal communication). Is it a coincidence 
that these organelles are the best at coping with reactive oxygen species? 

Crystal Structures; Tolerance of Truncations 
Although GFP was first crystallized in 1974 (4) and diffraction patterns re- 
ported in 1988 (29), the structure was first solved in 1996 independently by 
Ormo et al (30), Protein Data Bank accession number IBM A, and by Yang et al 
(31), accession number 1GFL. Both groups relied primarily on multiple anoma- 
lous dispersion of selenomethionine groups to obtain phasing information from 
recombinant protein. Subsequent structures of other crystal forms and mutants 
(32-34a) have been solved by molecular replacement from the 1EMA coordi- 
nates. GFP is an 11-stranded /3-barreI threaded by an a-helix running up the 
axis of the cylinder (Figure 3). The chromophore is attached to the or-helix and 
is buried almost perfectly in the center of the cylinder, which has been called a 
/3-can (31, 34a). Almost all the primary sequence is used to build the ^-barrel 
and axial helix, so that there are no obvious places where one could design large 
deletions and reduce the size of the protein by a significant fraction. Residues 
1 and 230-238 were too disordered to be resolved; these regions correspond 
closely to the maximal known amino- and carboxyl- terminal deletions that still 
permit fluorescence to develop (35). A surprising number of polar groups and 
structured water molecules are buried adjacent to the chromophore (Figure 4). 
Particularly importantare Gln69, Arg96, Hisl48, Thr203, Ser205, and GIu222. 

Dimerization 

The excitation spectrum of wild-type GFP changes its shape as a function of 
protein concentration, implying some form of aggregation (8). The spectro- 
scopic effects of such aggregation are discussed in the section on "Absorbance 
and Fluorescence Properties." In the Yang et al structure for wild-type GFP 
(31), the GFP is dimeric. The dimer interface includes hydrophobic residues 
Ala206, Leu221, and Phe 223 as well as hydrophilic contacts involving Tyr39, 
Glul42, Asn 144, Serl47, Asnl49, Tyrl51, Argl68, Asnl70, Glul72, Tyr200,' 



MAY. 19. 2004 9:35AM Mail I N FOG AT E- 1 E INFO EXP 

EXHIBIT A 



NO. 7 9 5 1 P. 9/37 




MAY. 19. 2004 9:35AM Mail INFOGATE-IE INFO EXP NO. 7951 P. 10/37 

EXHIBIT A 



GREEN FLUORESCENT PROTEIN 5 1 7 



R96 




Figure 4 Amino acid side chains, main chain carbonyls and amides, and solvent waters in the 
immediate vicinity of the chromophore of S65T GFP (30). Side chains are labeled with the one- 
letter code for the amino acid and the residue number. Main chain groups are labeled with the 
residue number. Water oxygens are denoted by W and the corresponding serial number within 
Protein Data Bank structure 1EMA. Probable hydrogen bonds are shown as dotted lines labeled 
with the distance between the heteroatoms in angstroms. Obviously the true three-dimensional 
relationships cannot be depicted accurately in this two-dimensional schematic. Figure courtesy of 
SJ Remington, University of Oregon. 



Ser202, Gln204, and Ser208. However, the same wild-type GFP could also 
crystallize as a monomer (32), isomorphous to the monomeric crystals formed 
by the S65T mutant (30). Even though GFP can hardly be more concentrated 
than in a crystal, the formation of dimers seems to be highly dependent on 
crystal growth conditions rather than an obligatory feature of GFP (33). The 
dissociation constant for the homodimer has been estimated as 100 /jM (34a). 
By contrast, Renilla GFP is an obligate dimer, which is dissociated only under 
denaturating conditions (14). 
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ABSORB ANCE AND FLUORESCENCE PROPERTIES 
Classification ofGFPs 

The currently known GFP variants may be divided into seven classes based on 
the distinctive component of their chromophores: class 1, wild-type mixture of 
neutral phenol and anionic phenolate; class 2, phenolate anion; class 3, neutral 
phenol; class 4, phenolate anion with s tacked jr -electron system ; class 5, indole; 
class 6, imidazole; and class 7, phenyl. Each class has a distinct set of excitation 
and emission wavelengths (Table 1). Classes \-A are derived from polypeptides 
with Tyr at position 66, whereas classes 5-7 result from Trp, His, and Phe at 
that position. Structures of the resulting chromophores are shown in Figure 5, 
together with typical fluorescence spectra. 

WILD-TYPE MIXTURE OF NEUTRAL PHENOL AND ANIONIC PHENOLATE (CLASS 1) 
The wild-type Aequorea protein has the most complex spectra of all the GFPs. 
It has a major excitation peak at 395 nm that is about three times higher in 
amplitude than a minor peak at 475 nm. In normal solution, excitation at 
395 nm gives emission peaking at 508 nm, whereas excitation at 475 nm 
gives a maximum at 503 nm (21). The fact that the emission maximum de- 
pends on the excitation wavelength indicates that the population includes at 
least two chemically distinct species, which do not fully equilibrate within the 
lifetime of the excited state. At pH 10-11, when the protein is on the verge 
of unfolding, increasing pH increases the amplitude of the 475-nm absorbance 
or excitation peak at the expense of the 395-nm peak (8). The simplest in- 
terpretation is that the 475-nm peak arises from GFP molecules containining 
deprotonated or anionic chromophores, whereas the 395-nm peak represents 
GFPs containing protonatcd or neutral chromophores (21, 22). The latter would 
be expected to deprotonate in the excited state, because phenols almost always 
become much more acidic in their excited states. Light-induced ionization to 
the anion would explain why excitation of the neutral chromophores gives 
emission at greater than 500 nm, similar to but not quite identical to the di- 
rect excitation of anionic chromophores. Picosecond spectroscopy gives direct 
evidence for such excited-state proton transfer (36). After a flash at 395 nm, 
the emission shifts from a 460- to a 508-nm peak over about 10 ps. These 
kinetics can be slowed greatly by cooling to 77°K and increasing viscosity, 
or by deuterium substitution, which argue strongly for excited-state proton 
transfer. 

During most light absorption/emission cycles, the proton transfer eventually 
reverses. However, occasionally the proton does not return to the chromophore, 
so the neutral chromophore is photoisomerized to the anionic form. Thus on 
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Ifrble 1 Spectral characteristics of the major classes of green fluorescent proteins (GFPs) 



Mutation 3 




WO" 




#37°C 


References 6 


Class 1, wild-type 














Wild type 


395-397 (25-30) 


504(0.79) 


6 


43,45 






470-475 (9.5-14) 




F99S, M153T, V163A 


Cycle 3 


397(30) 


506 (0.79) 


100 


43,45 






475(6.5-8.5) 






Class 2, phenolate anion 












S65T 




4S9 (52-58) 


509-511(0.64) 






F64L, S65T 


EGFP 


m (55-57) 








F64L ( S65T,V163A 




4KB (42) 


511 (0.58) 






S65T,S72A,NI49K, 




487 (57.5) 


509 (0 68) 






MI53T.1I67T 










Class 3, neutral phenol 












S202F, T203I 


H9 


399 (20) 
399 (29) 


511 (0.60) 




44 


T203I, S72A, YI45F 


H9-40 








Class 4, phenolate anion with sta 


eked jr-eleclron system (yellow Hut 








S65G, S72A.T203F 




512 (65.5) 


522 (0.70) 


6 


U 


S65G, S72A, T203H 




SOS (48.5) 


518 (0.78) 






S65G. V6SL, Q69K 


1QCQ69K 


516(62) 


529 (0.71) 


50 


44 


S72A, T203Y 










S65G,V6SL.S72A,T203Y 


IOC 


514(83.4) 


527 (0.61) 


58 


44 


S65G. S72A, K79R, 
T203Y 


Topaz 


514(94.5) 


527 (0.60) 


100 


44 


Class 5, indole in chromophore (cyan fluorescein proteins) 








Y66W 




436 






21 


Y66W, N146I, M153T, 


W7 


434(23.9) 


476 (0.42) 


61 


44 


V1S3A 




452 


505 






F64L, S65T, Y66W, 


W1B or 


434 (32.5) 
452 


476 (0.4) 
505 


HO 


44 


N146r,M153T,V163A 


ECFP 






S65A, Y66W. S72A, 


wtc 


435 (21.2) 


+95 (0.39) 


100 


44 


N146I, M153T.V163A 










Class 6, imidazole in chromophoi 


re (bine fluore 


aceni proteins) 








Y66H 


BFP 


384(21) 


448 (0.24) 






Y66H. Y145F 


P4-3 


382 (22,3) 


446(0.3) 


52 


44 


F54L, Y66H, Y145F 


KUFP 


380-383 (26.3-31) 


440-447 


100 


43,44 








(0.17-O.26) 




Class 7, phenyl in chromophore 












Y66F 




360 


442 




22 



'Substitutions from the primary sequence of GFP (see Figure 1) are given ns the single-letter code for the amino 
acid being replaced, its numerical position in (he sequence, and the single-letter code for the replacement Note 
that many valuable mutants have been left out of Ihis table for reasons of brevity and because quantitative spectral 



ed GFPs is the 



;ss of properly m 



.uiu uiijjiiLiicas uuia were ikk avauaoie; mereiore omission does not imply denigration. Pheiiotypicallv 
substitutions such as QS0R, H23 1L, and insertion of residue 1 a (see Figure I) have also been omitted 

A M( .is the peak of the excitation spectrum inunits of nanometers, e in parentheses is the absorbance extinction 
coefflcientin units of 1 0 3 M" 1 cm" 1 . Estimates of extinction coefficients have tended toincrease as expression and 
purification are optimized; obsolete older values have been omitted. Two numbers separated by a dash indicate a 
range of esiimates from different authors working under slightly different conditions Two numbers on senarate 
lines indicate two distinct peaks in the excitation spectrum. 

''.>.,.„. is the penkof theomissioiispectniminuniis ofnanom ., . 

yield, which is dimensionless. The best figure of merit for the overall brightne 
product of e and QY. See footnote b for explanation of pairs of values. 

^Relative fluorescence intensities for proteins expressed in Escherichia coii at 37"C from the same vector 
background under similar conditions. These numbers include not only ihe intrinsic brightnesses measured bv 
6 QY but "to ,he foldin e efficiencies st 37«C They are only rough climates, which will change under different 
expression conditions. They have been arbitrarily normalized to 1 00 for ihe brightest member of each class and 
cannot be used to compare different classes. 

References only for the quantitative spectral and brightness data. References to the origin and use of the 
mutants have been omitted for lack of space. 
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intense UV illumination, the 395-nm absorbance and excitation peak of the 
neutral form gradually declines and the 470-nm peak of the chromophore anion 
increases (22, 36, 37). Before illumination, wild-type GFP contains about a 6:1 
ratio of neutral-to-anionic forms, but with enough UV the percentage of anionic 
form can increase several-fold. The probable mechanism (32, 33) is that proton 
transfer occurs via the hydrogen bonds of a buried water and Ser205 to Glu222. 
Meanwhile the side chain of Thr203 rotates to solvate and stabilize thephenolate 
oxyanion. In the crystal structure of monomeric wild-type GFP, Thr203 exists 
in two conformations: approximately 85% with the OH facing away from the 
phenol oxygen, and 15% with the OH rotated toward it (32). This proportion 
agrees well with the spectroscopic estimate for the ratio of neutral to anionic 
chromophores at equilibrium (36). 

Wild-type GFP folds fairly efficiently when expressed at or below room 
temperature, but its folding efficiency declines steeply at higher temperatures. 
Presumably this natural temperature sensitivity is of no consequence to the 
jellyfish, which would never encounter warm water in the Pacific Northwest. 
Temperature sensitivity is restricted to the folding process. GFP that has 
matured properly at low temperature is stable and fluorescent at temperatures 
up to at least 65°C. The poor ability of GFP to mature in warm tempera- 
tures has been used in pulse-chase experiments in which the fate of fluores- 
cent protein made at low temperatures is followed after restoration of normal 
warmth and simultaneous suppression of new fluorescence (38, 39). However, 
for other applications it would be desirable to have a GFP that works well at 
37 a C. 

The most extensive attempt to develop such a mutant while preserving the 
complex wild-type spectrum utilized DNA shuffling (40), a technique for re- 
combining various mutations while creating new ones. This approach produced 
a triple mutant, F99S, M153T, V163A, which improved 37°C-folding, reduced 
aggregation at high concentrations, and increased the diffusibility of the protein 
inside cells (37). The latter two mutations had already been found by more 
conventional mutagenesis procedures (41,42). Such folding mutations do not 



Figure 5 Fluorescence excitation and emission spectra (solid and dashed lines, respectively) for 
typical members of the six major classes of GFP mutants, together with the chromophore structures 
believed to be responsible for the spectra Spectra have been normalized to a maximum amplitude 
of 1. For comparison of absolute brightnesses, see the extinction coefficients and quantum yields 
m Table 1. When only one structure is drawn, both excitation and emission spectra arise from the 
same state of chromophore protonation. The actual GFPs depicted are (a) wild-type (b) Emerald 
(c) H9-40, (d) Topaz, (e) W1B, and (/) P4-3. The detailed substitutions within each of these 
variants are listed in Table 1. 
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increase the intrinsic brightness of properly matured GFP molecules. Such 
brightness is measured by the product of extinction coefficient and fluorescence 
quantum yield (Table 1). The folding mutations merely increase the percentage 
of molecules that mature properly under adverse conditions, such as 37°C and 
high GFP concentrations that promote aggregation (33, 43^5). Although the 
folding mutations are highly valuable and should be incorporated routinely into 
new constructs, they produce less dramatic or no improvements at lower tem- 
peratures and levels of expression. Also the increments in brightness achieved 
by compounding such mutations will be limited by the obvious fact that folding 
cannot exceed 100% efficiency. 

The coexistence of neutral and anion chromophores giving two excitation 
peaks in the wild-type spectrum has a few advantages and many disadvantages 
for cell biological applications. If die GFP fluorescence is to be detected by the 
naked eye, UV excitation is convenient (40) because UV is inherently invisible. 
However, because intense UV can damage the eye, an external excitation- 
blocking filter would be advisable even for visual inspection. Also, scattering, 
autofluorescence, and the possibility of tissue damage are more severe with UV 
excitation. Excitation at the 470-nm peak would reduce these problems but is 
inefficient because only 15% of the protein has the anionic chromophore that 
absorbs there. The photoisomerization is a major hindrance to quantitation of 
images, but it also permits the diffusion or trafficking of GFP-labeled proteins 
to be monitored by locally irradiating a cell with a point or stripe of intense 
UV and then imaging the subsequent fate of the photoisomerized protein with 
470-nm excitation (37). 

PHENOLatb ANION in CHROMOPHORE (CLASS 2) GFPs with phenolate anions 
in the chromophore have become the most widely used class for routine cell 
biological use because they were the first group to combine high brightness with 
simple excitation and emission spectra peaking at wavelengths very similar to 
fluorescein, the most popular small-molecule fluorophore. The most commonly 
used mutation to cause ionization of the phenol of the chromophore is a replace- 
ment of Ser65 by Thr, or S65T (25), though several other aliphatic residues such 
as Gly, Ala, Cys, and Leu have roughly similar effects (25, 46, 47). The triple 
mutation F64M, S65G, Q69L, found by random mutagenesis around the chro- 
mophore, has achieved considerable popularity under the name RSGFP4 (46). 
In both S65T and RSGFP4, the wild-type 395-nm excitation peak due to the 
neutral phenol is suppressed, and the 470- to 475-nm peak due to the anion is 
enhanced five- to sixfold in amplitude and shifted to 489-490 nm (25, 46, 48), 
The oxidation to the mature fluorophore was about fourfold faster in S65T 
than in the wild type (25). Like wild-type GFP, S65T folds fairly efficiently 



MAY. 19. 2004 9:35AM Mail INFOGATE-IE INFO EXP NO. 7951 P. 16/37 

EXHIBIT A 



GREEN FLUORESCENT PROTEIN 523 

when expressed at room temperature or below but tends to misfold and pro- 
duce mostly nonfluorescent aggregates at higher temperatures. Because of the 
obvious interest in expression at 37°C, much effort has been devoted to finding 
additional mutations that give greater brightness at warmer temperatures. The 
most often used of these have been F64L (47) and VI 63 A (42), though other 
mutations such as S72A, NI49K, M153T, and I167T (33) (Table 1) can also 
be helpful alone or in combination. As with GFPs of wild-type spectra, such 
mutations improve only folding efficiency, not the brightness of properly folded 
molecules. 

The probable mechanism by which replacement of Ser65 promotes chro- 
mophore ionization (30, 32) is that only Ser65 can donate a hydrogen bond to 
the buried side chain of Glu222 to allow ionization of that carboxylate, which 
is within 3.7 A of the chroinophore. Gly, Ala, and Leu cannot donate hydrogen 
bonds, and Thr and Cys are too large to adopt the correct conformation in the 
crowded interior of the protein. Such residues at position 65 force the carboxyl 
of GIu222 to remain neutral. The other polar groups solvating the chromophore 
are then sufficient to promote its ionization to an anion, whereas if Glu222 is 
an anion, electrostatic repulsion forbids the chromophore from becoming an 
anion as well. This hypothesis explains why mutation of Glu222 to Gly gives 
the same spectral shape and wavelengths (49) as Ser65 mutations. However, 
practical applications of E222G have not been reported. 

neutral phenol in chromophore (class 3) Ionization of the chromophore 
cannot only be favored but. be repressed. Mutation of Thr203 to lie (21,49) 
largely suppresses the 475-nm excitation peak, leaving only the shorter wave- 
length peak at 399 nm. Presumably a chromophore anion cannot be adequately 
solvated once the OH of Thr203 is gone, so the chromophore is neutral in 
almost all the ground-state molecules. However, die emission is still at 511 
nm because the excited state remains acidic enough to eject a proton. This 
mutant and its folding-optimized descendants (Table 1) could be valuable alter- 
natives for UV-excited green fluorescence without the complicated photochem- 
istry of wild-type (class 1) GFPs. Because they lack an excitation maximum 
near 479-490 nm, they could be used in conjunction with the phenolate anion 
(class 2) GFPs for double-labeling. Images taken with the two different exci- 
tation bands near 400 and 480 nm but the same greater-than-500-nm emission 
would be compared. Even though the spectral contrast between the two GFPs 
would not be as great as when both excitation and emission wavelengths are 
varied, the use of two excitation wavelengths fits the many imaging systems 
designed for excitation-ratioing indicators and avoids any image registration 
problems created by alternating emission filters. The neutral phenol GFPs also 
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have the largest gap in wavelengths between excitation and emission peaks of 
any of the GFPs. This large Stokes' shift could be advantageous in supporting 
laser action, where it is important that the dye should have as little absorbance 
as possible at the wavelengths of fluorescence and lasing. 

PHENOLATE ANION WITH STACKED n -ELECTRON SYSTEM (CLASS 4) The 
longest wavelengths currently available by mutation result from stacking an 
aromatic ring next to the phenolate anion of the chromophore. So far the aro- 
matic ring has always come from the side chain of residue 203, and residue 
65 is Gly or Thr instead of Ser, to promote ionization of the chromophore. 
All four aromatic residues at that position 203 (His, Trp, Phe, and Tyr) in- 
crease the excitation and emission wavelengths by up to 20 rnn, with the shifts 
increasing in the stated order (30), These mutants were rationally designed 
from the crystal structure of S65T GFP in the expectation that the additional 
polarizability around the chromophore and n-n interaction would reduce the 
excited state energy, that is, increase both the excitation and emission wave- 
lengths. The mutants would have been nearly impossible to find by random 
mutagenesis, because all three bases in the original codon (ACA) encoding 
Thr203 would have to be replaced to encode an aromatic amino acid, and any 
random mutation rate high enough to give a significant probability of changing 
three bases in one codon would mutate so many other residues as to kill the 
protein. 

The actual crystal structure of a mutant containing Tyr203 has verified that 
its aromatic ring stacks next to the chromophore (RM Wachter, GT Hanson, 
AB Cubitt, K Kallio, RY Tsien, SJ Remington, manuscript in preparation). 
Mutation of Gln69 to Lys (Q69K) gives an additional shift of about 1-2 nm, 
resulting in an emission peak around 529 nm, the longest now known (Table 1 )'. 
Although 529 nm itself is rather greenish, the tail at longer wavelengths is 
sufficient to give the fluorescence an overall yellowish appearance, which is 
clearly distinguishable by eye from the more greenish emission of GFP classes 
1-3. Therefore class 4 GFPs have been called YFPs for yellowish fluorescent 
proteins (50), though this name has also been used for a fluorescent protein from 
Vibrio fischeri (16). The so-called Bio Yellow variety marketed by Pharmingen 
(51) is identical to RSGFP4 (46), a class 2 GFP with emission maximum at 
505 nm, the same as the wild type. 

INDOLE IN CHROMOPHORE DERIVED FROM Y66W (CLASS 5) Substitution of 
Trp for Tyr66 produces a new chromophore with an indole instead of a phenol 
or phenolate (21). Excitation and emission wavelengths are 436 and 476 nm, 
intermediate between neutral phenol and anionic phenolate chromophores. The 
increased bulk of the indole requires many additional mutations to restore 
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reasonable brightness (41), but when such mutations are provided, the over- 
all performance is fairly good (Table 1). These proteins are called cyan fluo- 
rescent proteins, or CFPs, because of their blue-green or cyan emission. One 
curious and so far unexplained feature (Figure 5) is that most have double- 
humped rather than conventional single excitation and emission peaks. The 
origin of the doubled emission peaks must be vibrational levels or other quan- 
tum states that equilibrate within the lifetime of the excited state, because 
their shapes and relative amplitudes are the same regardless of the excitation 
wavelength. 

IMIDAZOLE IN CHROMOPHORE DERIVED FROM Y66H (CLASS 6) Substitution of 
His for Tyr 66 puts an imidazole in the chromophore (21) and shifts the wave- 
lengths yet shorter than Trp66. The excitation and emission peaks are around 
383 and 447 nm (Table 1), so the emission is blue. A convenient abbreviation is 
therefore BFP, although other blue fluorescent proteins [e.g. spent aequorin and 
lumazine-containing proteins from Photohacterium phosphoreum (15)] have 
previously shared the same acronym. Crystal structures for several BFPs have 
been solved (33, 34). As usual, these proteins benefit considerably from fold- 
ing mutations (33, 41). BFP and a UV-excitable GFP permit double-labeling of 
cellular structures with two emission colors arising from a common excitation 
wavelength near 390 nm (28). However, even with folding improvements BFP 
still suffers from a relatively low fluorescence quantum yield and relatively 
easy bleaching (28). A functional dye laser has been constructed with purified 
BFP as the gain medium, but the duration of lasing at 450 nm is limited by 
protein bleaching (SJ Remington, D Alavi, M Raymer, RY Tsien, manuscript 
in preparation). 

PHENYL IN CHROMOPHORE DERIVED PROM Y66F (CLASS 7) The very shortest 
wavelengths are obtained with Phe at 66 (22). This mutant has been little in- 
vestigated because no obvious practical use for proteins requiring such short 
wavelength excitation has been proposed. Nevertheless it proves that any aro- 
matic residue at position 66 can form a chromophore. 

General Relation of Structure to Spectra 

The denatured wild- type protein absorbs maximally at 384 nm at neutral or 
acidic P H and at 448 nm at alkaline pH, with a pK a of 8.1 (7). This rough 
similarity to the absorbance and excitation maxima of the intact protein was 
a primary motivation for assigning the 395- and 470-nm excitation peaks of 
the latter to the neutral and anionic chromophores. Denatured GFPs or small 
proteolytic fragments carrying the chromophore are essentially totally nonfluo- 
rescent, presumably because the chromophore is unprotected from quench- 
ing by jostling water dipoles, paramagnetic oxygen molecules, or cis-trans 
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isomerization (7, 52). The slight difference in absorbance wavelengths between 
denatured and intact proteins is not unreasonable for the structured environment 
of the latter. In particular, Arg96 puts a positively charged guanidinium quite 
close to the carbonyl group of the imidazolinone. This cation would electro- 
statically stabilize increased electron-density on the carbonyl oxygen in the 
chromophore's excited state. This electrostatic attraction would explain much 
of the red shift of intact protein relative to denatured protein. Indeed, mutation 
of Arg96 to Cys in S65T blue-shifts the excitation maximum from 489 to 472 
nm and the emission maximum from 51 1 to 503 nm (R Ranganathan, personal 
communication), supporting a major role for Arg96 in lowering the energy of the 
excited state. Theoretical calculations of the energy levels of the chromophore 
in vacuo have led to the proposal that the imidazolinone-ring nitrogen adjacent 
to the hydroxy benzylidene must be protonated (53). However, the large effects 
on the chromophore of buried water molecules and the microenvironment sup- 
plied by the protein (52) would seem to provide a chemically more plausible 
explanation. 

Two -Photon Excitation 

One of the most promising new techniques in high-resolution fluorescence mi- 
croscopy is two-photon excitation (54), in which two infrared photons hit a 
fluorophore within a few femtoseconds of each other and sum their energy to 
simulate a single photon of half the wavelength, that is, ultraviolet to blue. Such 
coincidence of infrared photons requires extremely high fluxes and therefore 
occurs to a significant extent only at the focus of a microscope objective of high 
numerical aperture, illuminated by a pulsed laser. Because other regions of the 
specimen are effectively not excited, they neither emit fluorescence nor are sub- 
ject to photobleaching or photodynamic damage. As in confocal microscopy, 
the image is built up by scanning the focus point in a raster, but unlike confo- 
cal microscopy, out-of-focus planes are protected from bleaching, which is a 
tremendous advantage for two-photon excitation. 

GFPs are quite good fluorophores for two-photon excitation. Wild-type GFP 
is readily excited with 780- to 800-nm pulses (43, 54-56), which are in the 
optimal output range for commercial mode-locked titanium-sapphire lasers. 
However, the photoisomerization proceeds just as with 390- to 400-nm single- 
photon excitation (55). Class 3 (neutral phenol) GFP mutants have not been 
tried but should be better because they disfavor photoisomerization. S65T, the 
prototypic class 2 GFP mutant, is optimally excited near 910 nm and has a 
slightly higher two-photon cross-section than the wild type (54). Two-photon 
excitation is also effective on class 6 (imidazole) blue mutants (43) as well 
as class 5 (indole) cyan mutants (H Fujisaki, G Fan, A Miyawaki, RY Tsien, 
unpublished observations). 
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Effects ofpH 

As noted above, wild-type GFP at high pH (11-12) loses absorbance and ex- 
citation amplitude at 395 nm and gains amplitude at 470 nm (8, 57). Such pH 
values, though mechanistically revealing, are almost never encountered in bi- 
ology. Wild-type GFP is also quenched by acidic pH values with an apparent 
pK a near 4.5. Several of the mutants with enhanced spectral properties at pH 
7 are actually more acid sensitive than is the wild type; thus EGFP is 50% 
quenched at pH 5.5 (43). pl^s as high as 6.8 are found in some of the class 
4 mutants with Thr203 replaced by an aromatic residue (J Llopis, RY Tsien, 
manuscript in preparation; R Wachter, SJ Remington, personal communica- 
tion). The mechanistic explanation for these relatively high pK a s is not entirely 
clear. Loss of the Thr203 hydroxyl would indeed be expected to destabilize the 
phenolate form of the chromophore. However, the effect of acid is to quench the 
fluorescence altogether rather than simply shift it toward the short wavelengths 
expected of aprotonated chromophore. The sensitivity of some GFPs to mildly 
acidic pH values carries both advantages and disadvantages. Such GFPs could 
be quenched to a major extent in acidic organelles such as lysosomes, endo- 
somes, and Golgi compartments. The pH sensitivity of some GFPs can also be 
put to good use to measure organellar pH (J Llopis, RY Tsien, manuscript in 
preparation; R Wachter, SJ Remington, personal communication) by targeting 
appropriate GFPs to those locations. 

Effects of Temperature and Protein Concentrations 
Higher GFP concentrations amplify the main excitation peak at 395 nm at the 
expense of the subsidiary peak at 470 nm (8). Because the 395- and 470-nm 
peaks are believed to result from neutral and anionic fluorophores, respectively, 
aggregation probably inhibits ionization of the fluorophores . Increasing temper- 
ature from 15 to 65°C modestly decreases the 395-nm and increases the 470-nm 
excitation peak of mature wild-type GFP. Yet higher temperatures cause denat- 
uration, with 50% of fluorescence lost at 78°C (8). As already mentioned, much 
more modest temperature increases from 20 to 37° C can profoundly decrease 
maturation efficiency of GFPs lacking mutations to improve folding. 

Effects of Prior Illumination 

GFPs have a variety of remarkable abilities to undergo photochemical trans- 
formations, which enables visualization of the diffusion or trafficking of GFP- 
tagged proteins. A defined zone within a cell or tissue is momentarily exposed 
to very bright illumination, which initiates the photochemistry. The subsequent 
fate of the photoconverted protein is imaged over time. At least four distinct 
types of semipermanent photochemical transformation have been reported from 
one or more GFPs: (a) simple irreversible photobleaching, (b) conversion from 
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a 395- to 475-nm excitation maximum, (c) loss of 488-nm-excited fluorescence, 
reversible by illumination at 406 nm, and (d ) generation of rhodamine-like or- 
ange or red fluorescence upon illumination at 488 nm under strictly anaerobic 
conditions (58). 

irreversible photobleaching Photobleaching is the simplest and most 
universal behavior of fluorophores. Most GFPs are relatively resistant to pho- 
tobleaching (22, 59), perhaps because the fluorophore is well shielded from 
chemical reactants such as 0 2 . The bleach rate of the prototypic class 2 GFP, 
the S65T mutant, was reported to be relatively indifferent to equilibration with 
0-100% oxygen or addition of quenchers of triplet states, singlet oxygen, and 
radicals (59) . Nevertheless, with sufficient laser power, photobleaching is easily 
observed and exploited for measurements of fluorescence recovery (37, 59, 60). 
The class 6 mutants (BFPs) are generally more photosensitive than classes 1-5 
(28). Cell-permeant antioxidants may be helpful in protecting such GFPs from 
bleaching. An example is Trolox, 6-hydroxy-2,5,7,8-tetramethylchroman-2- 
carboxylic acid, a water-soluble vitamin E analog commercially available from 
Akfrich Chemical Co., Milwaukee, WI. 

SHIFTING TO A LONGER-WAVELENGTH EXCITATION PEAK This behavior is 
characteristic of wild-type and other class 1 GFPs (11,22, 36). As discussed 
previously, the mechanism is probably a light-driven proton transfer from the 
neutral chromophorc to the carboxylate of Glu222, yielding an anionic chro- 
mophore and a protonated Glu222 (32). This UV-induced enhancement of the 
blue excitation peak has been exploited to measure lateral diffusion of GFP- 
tagged proteins (37). Because the proton transfer is mediated by Thr203 and 
Ser205, mutation of those residues might be a promising way to enhance this 
photochromic effect. Indeed, UV irradiation of the double mutant T203S, 
S205T increases the amplitude of its long- wavelength excitation peak 11.8- 
fold, whereas wild- type GFP under the same conditions increases by at most 
3.6-fold (R Heim, RY Tsien, unpublished information). 

REVERSIBLE LOSS OF LONGER-WAVELENGTH EXCITATION PEAK; SINGLE MOLE- 
CULE detection The opposite behavior, a shift from a longer to a shorter 
excitation wavelength, seems to occur in class 4 mutants. Upon intense laser 
illumination and observation of the fluorescence from single molecules immobi- 
lized in a polyacrylamide gel, such mutants both blink reversibly on a time scale 
of seconds and switch the fluorescence off over tens of seconds (61). However, 
the apparent bleaching can be reversed by illumination at short wavelengths 
such as 406 nm. Probably the chromophore, which is normally mostly anionic, 
can eventually be driven into a protonated state with an excitation maximum 
near 405 nm, whereupon it appears nonfluorescent and bleached to the probe 
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laser at 488 nm. However, excitation of the protonated state then restores the 
normal anionic state. Such cycling can be repeated many times with apparently 
no fatigue, so that it potentially represents a basis for an optical memory at the 
single molecule level. It might also be particularly advantageous for multiple 
determinations of diffusion or trafficking on the same region of interest. 

ANAEROBIC PHOTOCONVERSION TO A RED FLUORESCENT SPECIES A variety 
of GFPs, including wild-type, S65T, and EGFP, undergo a remarkable photo- 
conversion to a red fluorescent species under rigorously anaerobic conditions, 
for example, in microorganisms that have exhausted the oxygen in the medium, 
or in the presence of oxygen scavengers such as glucose plus glucose oxidase 
and catalase (58). The nature of this species emitting at 600 nm remains to be 
clarified. This effect has been used to measure the diffusibility of GFP in live 
bacteria. One complication is that the red emission develops with an exponential 
time constant of about 0.7 s after the illuminating flash (58). 

EXPRESSION, FORMATION, MATURATION, 
RENATURATION, AND OBSERVATION 
Promoters, Codon Usage, and Splicing 

The expression level and delegability of GFP depend on many factors, the most 
important of which are summarized in Table 2. Obviously the more copies of 
the gene and the stronger the promoters/enhancers driving its transcription, the 
more protein that will be made per cell. In plants, it has been important to alter 
the original codon usage to eliminate a cryptic splice site (62). Codons have 
also been altered to conform to those preferred in mammalian systems (63, 64) 
and in the pathogenic yeast Candida albicans (65). Some authors have found 
such codon alterations to improve expression levels in mammalian systems 
(44, 63, 64), whereas others have found little improvement (43). Because the 
mammalianized genes are now widely available and may well be beneficial, they 
might as well be incorporated into all new GFP constructs for use in vertebrates. 
Our impression is that mammalian codons do not hurt expression levels in 
bacteria. Yet another improvement for mammalian systems is the inclusion 
of an optimal ribosome-binding site, also known as a Kozak sequence for 
translational initiation (66). Such a sequence requires insertion of an additional 
codon immediately after the starting methionine, as shown in Figure 1. The 
additional valine or alanine (40) does not seem to interfere with protein function ; 
we prefer to number it la so that the numbering of the subsequent amino acids 
continues to correspond with the wild- type numbering. GFP can be expressed 
with reasonable efficiency in a cell-free in vitro translation system (67), a finding 
that confirms the protein can fold autonomously. 
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Table 2 Factors affecting the detectability of green fluorescent protein (GFP) 

Total amount of GFP (picked out by antibodies, or by position on gel if GFP is abundant enough) 
Number of copies of gene, duration of expression 
Strength of transcriptional promoters and enhancers 
Efficiency of translation including Kozak sequence and codon usage 
Absence of mRNA splicing, protein degradation and export 
Efficiency of posttranslational fluorophore formation 
Solubility vs. formation of inclusion bodies 
Availability of chaperones 

Hindrance to folding because of unfortunate fusions to host proteins 

Time, temperature, availability of Q 2 , and intrinsic rate of cyclization/oxidatton 
Molecular properties of mature GFP 

Wavelengths of excitation and emission 

Extinction coefficient and fluorescence quantum yield 

Susceptibility to photoisomerizarion/bleachmg 

Dimerization 
Competition with noise and background signals 

Autofluorescence of cells or culture media at preferred wavelengths 

Location of GFP, diffuse vs. confined to small subregions of cells or tissues 

Quality of excitation and emission filters and dichroic mirrors 

Sensitivity, noise, and dark current of photodetector 



Folding Mutations and Thermotolerance 

As mentioned previously, several mutations improve the ability of GFP to fold 
at temperatures above those to which the jellyfish would have been exposed. 
Most of them replace bulky residues with smaller ones . Their scattered locations 
throughout the three-dimensional structure of the mature protein (Figure 6) 
give little hint as to why they should help folding and maturation. Of course, 
the X-ray structures are all determined on well-folded mature proteins. At 
the stage when the mutations are needed, the protein is presumably less well 
ordered. As an alternative to mutating the GFP, the presence of chaperones 
can also help GFP fold (68). Aside from the potential technical importance 
of providing chaperones, this finding makes GFP a useful substrate for testing 
chaperone function (69), since GFP provides a continuous nondestructive assay 
for successful folding. 

Requirement for 0 2 

The requirement for 0 2 to dehydrogenate the a,p bond of residue 66 (21 , 23, 24) 
means that GFP probably cannot become fluorescent in obligate anaerobes'. So 
far this is the only fundamental limitation on the range of systems in which GFP 
can be expressed. Once GFP is matured, 0 2 has no further effect (59). The 
oxidation seems to be the slowest step (Figure 2) in the maturation of GFP (23), 
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so it imposes the ultimate limit on the ability of GFP fluorescence to monitor 
rapid changes in gene expression. Considering its importance, surprisingly lit- 
tle work has been done on how to accelerate this oxidation. The dependence of 
rate on oxygen pressure has not been characterized, so it is unknown whether 
p0 2 higher than ordinary atmospheric would speed up the reaction. The mutant 
S65T has been reported to oxidize with an exponential time constant of 0.5 h, 
4 times faster than the 2 h for wild-type protein under parallel conditions (25)! 
Strong reductants such as dithionite can decolorize mature GFP (24, 52), prob- 
ably by rehydrogenating the chromophore. Such reduction may also require 
that the GFP become denatured to allow access to the buried chromophore (23). 

Histology in Fixed Tissues 

Although the prime advantage of GFP is its ability to generate fluorescence in 
live tissue, its fluorescence does survive glutaraldehyde and formaldehyde fix- 
atives (11). Occasional problems in maintaining fluorescence during fixation 
may result from uncontrolled acidity of the fixative solution or the use of exces- 
sive organic solvents, which denature the protein and destroy the fluorescence 
(7)- 



PASSIVE APPLICATIONS OF GFP 

Cell biological applications of GFP may be divided into uses as a tag or as an 
indicator. In tagging applications, the great majority to date, GFP fluorescence 
merely reflects levels of gene expression or subcellular localizations caused 
by targeting domains or host proteins to which GFP is fused. As an indicator, 
GFP fluorescence can also be modulated posttranslationally by its chemical 
environment and protein-protein interactions. 

Reporter Gene, Cell Marker 

The first proposed application of GFP was to detect gene expression in vivo 
(11), especially in the nematode Caenorhabditis elegans, whose cuticle hinders 
access of the substrates required for detecting other reporter genes. GFP was 
particularly successful at confirming the pattern of expression of the mec-7 
promoter, which drives the formation of ^-tubulin in a limited number of 
mechanosensory neurons. GFP's independence from enzymatic substrates is 
likewise particularly promising in intact transgenic embryos and animals (70- 
76) and for monitoring the effectiveness of gene transfer (48, 77, 78). However, 
GFP seems to need rather strong promoters to drive sufficient expression for 
detection, especially in mammalian cells. Most published examples, even those 
using brightened GFPs with mutations to promote folding at 37°C, have used 
constitutive promoters from viruses such as cytomegalovirus (CMV), SV40, 
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or HIV long terminal repeat (79), or strong exogenous regulators such as the 
tetracycline transactivator system (80, 81), rather than native genetic response 
elements modulated by endogenous signals. 

The somewhat disappointing sensitivity of GFP as a so-called gene-tag is 
probably an inherentresult of its lack of amplification. GFP is not an enzyme that 
catalytically processes an indefinite number of substrate molecules. Instead, 
each GFP molecule produces at most one fluorophore. It has been estimated 
that 1 fiM well-folded wild-type GFP molecules are required to equal the 
endogenous autoftuorescence of a typical mammalian cell (55), that is, to double 
the fluorescence over background. Mutant GFPs with improved extinction 
coefficients might improve this detection limit six- to tenfold (43) (see also 
Table 1), but 0.1 /iM GFP is still approximately 10 5 copies per typical cell 
of 1-2 pL volume. This estimate already assumes perfect GFP maturation; 
imperfect or incomplete maturation would raise the threshold copy number 
even further. The ultimate sensitivity limit is set not by instrumentation but by 
cellular autofluorescence. 

If cytosolic GFP is inadequately sensitive as a reporter gene, two alterna- 
tives should be considered. If the gene product can be detected by microscopic 
imaging with subcellular resolution, then targeting the GFP to a defined sub- 
compartment of the cell can greatly reduce the number of molecules required. 
The GFP becomes highly concentrated, and the surrounding unlabeled region 
of the cell provides an internal reference for the autofluorescence background, 
which is usually diffuse in each cell. It is far easier to see local contrast within 
a cell than to quantitatc a cell's average fluorescence relative to unlabeled stan- 
dards. Thus as few as 300-3000 GFPs packed into a centrosome are readily 
visible as a green dot inside a cell (82). However, compartmentation of the GFP 
does not help with nonimaging detection methods such as fiuorometry in cuvets 
or microtiter plates or fluorescence-activated cell sorting (FACS). A different 
solution is to use reporter gene products that can enzymatically catalyze a large 
change in the fluorescence of substrates that can be loaded into intact, fully 
viable cells. For example, the bacterial enzyme ^-lactamase can be detected at 
levels as low as 60 pM in single mammalian cells (50 molecules per cell) with 
substrates loaded as membrane-permeant esters (83). 

Fusion Tag 

The most successful and numerous class of GFP applications has been as a 
genetic fusion partner to host proteins to monitor their localization and fate. The 
gene encoding a GFP is fused in frame with the gene encoding the endogenous 
protein and the resulting chimera expressed in the cell or organism of interest. 
The ideal result is a fusion protein that maintains the normal functions and 
localizations of the host protein but is now fluorescent. The range of successful 
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fusions is now much greater than previously tabulated (22). Not all fusions are 
successful, but the failures are almost never published, so it is difficult to assess 
the overall success rate. GFP has been targeted successfully to practically 
every major organelle of the cell, including plasma membrane (37, 84-87), 
nucleus (28,38, 87-92), endoplasmic reticulum (50,60,93), Golgi apparatus 
(93), secretory vesicles (39,94), mitochondria (28,89,95,96), peroxisomes 
(97), vacuoles (98), and phagosomes (99). Thus the size and shape of GFP 
and the differing pHs and redox potentials of such organelles do not seem 
to impose any serious barrier. Even specific chromosomal loci can be tagged 
indirectly by inserting multiple copies of Lac operator sites and decorating them 
with a fusion of GFP with the Lac repressor protein (100). In general, fusions 
can be attempted at either the amino or carboxyl terminus of the host protein, 
sometimes with intervening spacer peptides. However, the crystal structures of 
GFP (30, 31) show that the N- and G-terminii of its core domain are not far apart, 
so it might be possible to splice GFP into a noncritical exterior loop or domain 
boundary of the host protein. For example, residues 2-233 of GFP have been 
inserted between the last transmembrane segment and the long cytoplasmic tail 
of a Shaker potassium channel (100a). 

GFP AS AN ACTIVE INDICATOR 

The rigid shell in GFP surrounding the chromophore enables it to be fluorescent 
and protects it from photobleaching but also hinders environmental sensitivity, 
Nevertheless, GFPs that act as indicators of their environment have been created 
by combinations of random and directed mutagenesis. The pH sensitivity of 
certain mutants and their potential application to measure organellar pH have 
already been mentioned. It is possible to engineer phosphorylation sites into 
GFP such that phosphorylation produces major changes in fluorescence under 
defined conditions (AB Cubitt, personal communication). The engineered fu- 
sion of GFP within the Shaker potassium channel is the first genetically encoded 
optical sensor of membrane potential (100a). Depolarization causes at most a 
5% decrease in fluorescence with a time constant of approximately 85 ms, but 
both the amplitude and speed may well improve in future versions. Bu t the most 
general way to make biochemically sensitive GFPs is to exploit fluorescence 
resonance energy transfer (FRET) between GFPs of different color. FRET is 
a quantum-mechanical phenomenon that occurs when two fluorophores are in 
molecular proximity (<100 A apart) and the emission spectrum of one fluo- 
rophore, the donor, overlaps the excitation spectrum of the second fluorophore, 
the acceptor. Under these conditions, excitation of the donor can produce emis- 
sion from the acceptor at the expense of the emission from the donor that would 
normally occur in the absence of the acceptor. Any biochemical signal that 
changes the distance between the fluorophores or relative orientation of their 



M_AY. 19. 2004 9:35AM Mail INFOGATE-IE INFO EXP NO. 7 9 5 1 P. 28/37 

EXHIBIT A 



GREEN FLUORESCENT PROTEIN 535 

transition dipoles will modulate the efficiency of FRET (101-103). Because 
FRET is a through-space effect, it is not necessary to perturb either GFP alone 
but rather only the linkage or spatial relationship between them. The potential 
utility of FRET between GFPs was the main motivation for the development of 
most of the mutations in Table 1. The change in ratio of acceptor to donor emis- 
sions is nearly ideal for cellular imaging and flow cytometry because the two 
emissions can be obtained simultaneously and their ratio cancels out variations 
in the absolute concentration of the GFPs, the thickness of the cell, the bright- 
ness of the excitation source, and the absolute efficiency of detection. Because 
the sample need be excited at only one wavelength, which should preferentially 
excite the donor, FRET is ideal for laser-scanning confocal microscopy and 
FACS (103). FRET also causes changes in donor fluorescence lifetime and 
bleaching rate (104), but detection of those signals either requires much more 
sophisticated instrumentation or is destructive. 

Protease Action 

The simplest and first-demonstrated way to achieve and modulate FRET be- 
tween GFPs was to fuse a blue-emitting (class 6) GFP mutant (i.e. a BFP) to a 
phenolate-containing (class 2) green GFP via an intervening protease-sensitive 
spacer (41, 105). The broad emission spectrum of the donor BFP, peaking 
at 447 nm, overlaps fairly well with the excitation spectrum of the class 2 
GFP, peaking at 489 nm. Wild-type GFP would not be satisfactory, because 
the 383 nm used to excite the BFP would directly and efficiently excite (he 
395-nm excitation peak of wild-type GFP even in the absence of FRET. The 
tandem fusion exhibits FRET, which is then disrupted when a protease is added 
to cleave the spacer and let the GFPs diffuse apart. Heim & Tsien (41) used 
P4-3 and S65C or S<55T and a trypsin- or enterokinase-sensitive 25-residue 
linker, and achieved a 4.6-fold increase in the ratio of blue to green emissions 
resulting from protease action. Control experiments verified mat the two GFPs 
were unaffected by the proteases at the concentrations used, so the spectral 
change reflected cleavage of the linker. Mitra et al (105) used BFP5 (F64M, 
Y66H) and RSGFP4 with a Factor X a -sensitive linker and obtained a 1. 9-fold 
increase in the analogous ratio. Although FRET-based assays for proteases are 
well known (106, 107), synthetic peptide substrates are limited in length and 
useful only in vitro. The special advantages of GFP-based constructs are that 
they could incorporate full-length protein substrates and could be expressed 
and assayed inside live cells or organisms. 

Transcription Factor Dimerimtion 

A static homodimerization of the transcription factor Pit-1 has been detected 
by coexpression of BFP-Pit-1 and GFP-Pit-1 fusions in HeLa cells. Homo- 
dimerization is inherently more difficult than heterodimerization to demonstrate 
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by FRET, because at most 50% of the complexes will combine BFP- and GFP- 
labeled proteins, while nonproductive BFP-BFP and GFP-GFP complexes will 
each account for 25% of the homodimers. Nevertheless, careful spectral anal- 
ysis indicated that homodimerization was detectable by FRET (108). Unfor- 
tunately, no modulation of the Pit-1 interaction or new biological conclusions 
were reported. 

Ca 2+ Sensitivity 

The first dynamically responsive biochemical indicators based on GFP are Ca 2+ 
sensors, independently developed almost simultaneously by Romoser et al (109) 
and by Miyawaki et al (50). Romoser et al linked commercially available class 
6 BFP and class 2 GFP mutants with a 26-residue spacer containing the calmod- 
ulin (CaM)-binding domain from avian smooth muscle myosin light chain ki- 
nase. This spacer allowed FRET to occur from the BFP to the GFP, perhaps 
because it was long and flexible enough for the two GFPs to dimerize. Addition 
of Ca 2+ -CaM disrupted FRET, presumably by binding to and straightening the 
linker so that the two GFPs were unable to dimerize. Such binding of Ca 2+ - 
CaM decreased the 505 nm emission by 65% and the ratio of 505- to 440-nm 
emissions by sixfold in vitro, an impressive spectral change for a reversible 
conformational change less drastic than proteolytic cleavage. The bacterially 
expressed recombinant protein was then microinjected into individual HEK- 
293 cells. In such intact cells, elevations of cytosolic free Ca 2+ produced 
much more modest decreases (5-10%) in 510-nm emission, which could be 
amplified to about 30% decreases if exogenous calmodulin was co-injected. 
Thus the response of the indicator in cells was limited by CaM availability, im- 
plying that the indicator is responsive to cellular Ca 2+ -CaM rather than Ca 2+ 
per se. Because the heterologously expressed protein had to be microinjected, 
the unique ability of GFP to be continuously synthesized by the target cell was 
not exploited, and only cytosolic signals could be monitored. 

Miyawaki et al (50) fused BFP or class 5 cyan-fluorescent protein (CFP) to 
the N-terminus of CaM, and class 2 GFP or class 4 yellow-fluorescent protein 
(YFP) to the C-terminus of M13, the CaM-binding peptide from skeletal mus- 
cle myosin light chain kinase. The CFP-CaM and the Ml 3- YFP could either 
be fused via two glycines (in which case all four protein domains were joined 
into a 76-kDa tandem chimera) or left separate. In either case, binding of Ca 2+ 
to the CaM caused it to grab the M13, thus increasing FRET, the opposite 
spectral effect from that of Romoser et al (109). By using GFPs with mutations 
to optimize mammalian expression, the indicators were bright enough to be 
introduced into cells by DNA transfection rather than protein microinjection. 
Because the four-domain chimeras were expressed in situ, they could readily be 
targeted to organelles such as the nucleus or endoplasmic reticulum by addition 
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Table 3 Advantages and disadvantages of GFP-based Ca 2+ indicator 0 



Advantages 

Applicable to nearly all organisms; no need for ester permeation and hydrolysis 
Can be targeted to specific tissues, cells, organelles, or proteins 
Unlikely to diffuse well enough to blur spatial gradients 
Modular construction is readily modified/improved by mutagenesis 
Good optical properties: visible excitation, emission ratioing, high photostability 
cDNAs or improved sequences are cheap to replicate and distribute 
Should be generalizable to measure many bioactive species other than Ca 2+ , 
as long as a conformational!)' sensitive receptor is available 
Disadvantages 

The maximum change in emission ratio is currently less than for small-molecule dyes 

The binding kinetics are somewhat slower 

The CaM or M13 might have some additional biological activity 

•GFP, green fluorescent proteins. ' 



of appropriate targeting sequences. Ca 2+ affinities were readily adjustable by 
mutation of the CaM. Thus free Ca 2+ concentrations in the endoplasmic retic- 
ulum were measured to be 60-400 fiM in unstimulated cells, decreasing to 
1-50 iiM in cells treated with Ca 2+ -mobilizing agonists. Advantages and dis- 
advantages of the GFP-based Ca 2+ indicators (called cameleons) compared to 
conventional Ca 2+ indicators are summarized in Table 3 . 

The constructs with separate CFP-CaM and M13-YFP proved that FRET 
between GFP mutants can dynamically monitor protein-protein interaction in 
single living cells. Binding of the two host proteins to each other brings their 
fused GFPs into proximity and enhances FRET. By comparison the mech- 
anism of Romoser et al (109) requires that mutual binding must substan- 
tially change the distance between N- and C-terminii of at least one of the 
partners. 

In principle the use of FRET offers some major advantages and disadvantages 
over other current methods for detecting protein-protein interaction (Table 4). 
The most unique advantages are the spatial and temporal resolution and the 
ability to observe the proteins in any compartment of the cell. The biggest dis- 
advantage is that even in the absence of any protein interaction, a substantial 
background signal is present when illuminating at the donor's excitation max- 
imum and observing at the acceptor's emission maximum. This background 
signal arises because the donor emission has a tail that extends into the accep- 
tor's emission band, and the acceptor excitation has a tail that extends into the 
donor's excitation band. For these reasons, FRET is probably not a suitable 
method for detecting trace interactions or fishing for unknown partners but 
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Table 4 Advantages and disadvantages of FRET between (GFPs) to monitor protein inter- 



Advantages 

Works in vitro and in living mammalian cells, not just yeast 
Can respond dynamically to posttranslational modifications 
Has high temporal (milliseconds) and spatial (submicron) resolution 
Interacting proteins can be anywhere in the cell and do not need to be sent to nucleus 
Degree of association can be quantified, if 0% and 100% binding can be established in situ 
Efficiency of FRET at 100% complexation gives some structural information 
Disadvantages of intermolecular FRET 

Must express fusion proteins, in which host protein and GFPs must both remain functional 
If GFPs are too far from each other (»80 A) or unluckily oriented, FRET will fail 
Even with no association, spectral overlap contributes some signal at the FRET wavelengths 
Trace or rare interactions will be hard to detect 

Need negative and positive controls, i.e. reference conditions of 0% and 1 00% association 
Homodimerization is more difficult to monitor than heterodimerization 



is best used at a later stage when the two host proteins are molecularly well 
characterized and the detailed spatioLcmporal dynamics of their interaction is 
to be determined in live cells. 

What Are the Best FRET Partners? 

The efficiency of FRET is given by the expression R§/(R§ + r 6 ), where r is the 
actual distance between the centers of the chromophores and R 0 is the distance 
at which FRET is 50% efficient. R 0 depends on the quantum yield of the donor, 
the extinction coefficient of the acceptor, the overlap of the donor emission and 
acceptor excitation spectra, and the mutual orientation of the chromophores 
(102). The early attempts to obtain FRET between GFPs all used BFPs (class 
6 mutants) as donors and class 2 (phenolate anion) GFPs as acceptors because 
these were the first available pairs with sufficiently distinct wavelengths. For 
such pairs, Ro is calculated to range from 40 to 43 A. However, the poor 
extinction coefficients, quantum yields, and photostabilities of the BFPs have 
convinced us (50) that cyan mutants (class 5) are much better donors. The 
acceptor correspondingly must become a class 4 yellow mutant so that its 
excitation spectrum overlaps the donor emission as much as possible, whereas 
the two emissions remain as distinct as possible. Combinations of cyan donors 
with yellow acceptors have R 0 values of 49-52 A and are our currently preferred 
donor-acceptor pairs. So far, the highest value of R 0 between two GFPs is 60 A 
for class 3 mutant H9-40 as donor to Topaz, but unfortunately these mutants' 
emission spectra are too close to each other for good discrimination. 

The above calculations have assumed that the GFPs are randomly oriented 
or tumbling with respect to one another, which is the conventional assumption 
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made in calculating R 0 (102). If instead the mutual orientation of the two 
chromophores were the same as in the crystal structure for the wild-type dimer 
(31), the Ros would be about 72% of the previously calculated values, for ex- 
ample, 35-37 A for cyan-to-yellow FRET. For comparison, the actual distance 
r between the centers of the chromophores in the dimer is about 25 A, which 
would predict that FRET would occur with about 90% efficiency in such a direct 
heterodimer between cyan and yellow mutants. 

OUTLOOK FOR FUTURE RESEARCH 

Despite all that has been learned about how GPP works and how it can be 
exploited as a research tool, enormous challenges and opportunities remain. 
Listed below are some unanswered general questions about GFP that are among 
the most intriguing, excluding problems related to narrow applications in cell 
biology: 

Cloning of Related GFPs 

What are the genetic sequences and structures of GFP homologs from biolu- 
minescent organisms other thanAequorea? This information would illuminate 
the evolution of fluorescent proteins, reveal the essential conserved elements 
of the structure, and provide the genetic raw material for combinatorial mixing 
and matching to produce hybrid proteins with new phenotypes. Renilla GFP is 
the most obvious next cloning target, but even more bioluminescent organisms 
should be investigated. 

Protein Folding and Chromophore Folding 

We need to know much more about how GFP folds into its yS-barrel confor- 
mation and synthesizes its internal chromophore. Now that many of the steps 
have been kinetically resolved (23), the effect of mutations on each of the steps 
needs to be determined. The most informative mutants will not be the majority 
that completely prevent the formation of fluorescence, because those could act 
anywhere in the entire cascade including disruption of the final state. Instead, 
mutants or chaperonins that affect the rates but not the final extent of fluores- 
cence development are likely to be most valuable. The molecular mechanism, 
kinetics, and byproducts of chromophore formation by 0 2 are particularly crit- 
ical questions. 

Altered Wavelengths of Fluorescence 

Yet longer wavelengths of excitation and emission than are currently available 
from the class 4 (^-stacked phenolate) mutants (Table 1) would be useful for 
multiple labels and reporters and to serve as resonance energy transfer acceptors. 
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For example, an increase in excitation maximum to 540-550 nm would permit 
efficient energy transfer from terbium chelates, whose millisecond excited-state 
lifetimes make them useful as energy transfer donors (110). The 560-570 nm 
emission from such longer- wavelength GFP mutants would also be distinct 
enough from the standard class 2 (phenolate) mutants to make such green- 
orange pairs useful as FRET partners. Another approach to improving FRET 
would be to reduce the emission bandwidth of the class 5 (tryptophan-based) 
cyanmutants and thereby improve the quantitative separation of cyanand yellow 
emissions. Emission spectral alterations should be most easily screened by 
fluorescence-activated cell sorting (FACS). The chemical structure of the red 
fluorescent species formed by intense illumination in anaerobic conditions (58) 
must be determined as the first step in making this extraordinary photochemical 
reaction more general and useful. 

Altered Chemical and Photochemical Sensitivities 
The sensitivity of GFP spectra to environmental factors such as pH and past 
illumination is valuable if pH indication or photochemical tagging is desired 
but is a nuisance for most other applications. Therefore we need to understand 
the molecular mechanisms of such environmental modulations and to find mu- 
tations that enhance or eliminate those mechanisms. In many cases, it will 
be important to use screening methods that, unlike FACS, permit longitudi- 
nal comparison of individual cells or colonies before and after a chemical or 
actinic challenge. Digital imaging of colonies on plates (e.g. 1 1 1) is likely to 
be advantageous. 

Fusions Other Than atN- or C-Terminus 

Almost all fusions of host proteins with GFP have been simple tandem fusions 
in which the C-terminus of one protein is genetically concatenated to the N- 
terminus of the other. Because not all such fusions work, general rules are 
needed in order to predict when fluorescence will be intact and when host protein 
function will be preserved. Sometimes neither order of simple concatenation 
produces functional chimeras. Could splicing of GFP into the middle of the 
host protein be made easier and more general? It might be helpful to engineer 
GFP to move its N- and C-terminii as close to each other as possible, perhaps 
by addition of spacers or by circular permutation. 

Alternatives to Fluorescence 

Chromophores can be harnessed to perform many functions other than fluores- 
cence, such as phosphorescence (emission from the triplet state), generation of 
reactive oxygen species such as singlet oxygen or hydroxyl radical, and photo- 
chemical cleavage. Can GFP be engineered to do such tricks? Phosphorescence 
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typically gives lifetimes in microseconds rather than nanoseconds and there- 
fore permits exploration of protein dynamics on longer time scales. Controlled 
generation of singlet oxygen can be useful to polymerize diaminobenzidine 
locally into a polymer visible by electron microscopy, so that the location of 
fiuorophores can be verified at ultrastructura] resolution (112). Laser pulses 
can be used to kill proteins within a few nanometers of a suitable chromophore 
that generates hydroxyl radicals or other reactive species (113). Photochemical 
cleavage is the basis of important methods to produce sudden changes in the 
concentration of signaling molecules (114). These techniques would be revo- 
lutionized if their crucial molecules could be synthesized or at least localized 
in situ under molecular biological control, in the same way as GFP. Of course, 
the rigid shell protecting the chromophore of GFP from the environment may 
intrinsically prevent even mutagenized GFPs from fulfilling such alternative 
functions, so that completely different proteins may need to be found or de- 
vised. Perhaps GFP will becomejust one prototype of a collection of genetically 
encoded, light-driven macromolecular reagents. 
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